Bilingual Indexing for Information Retrieval with AUTINDEX
نویسندگان
چکیده
AUTINDEX is a bilingual automatic indexing system for the two languages German and English. It is being developed within the EU-funded BINDEX project. The aim of the system is to automatically index large quantities of abstracts of scientific and technical papers from several areas of engineering. Automatic indexing takes place using a controlled vocabulary provided in monolingual and bilingual thesauri. AUTINDEX produces for a given abstract a list of descriptors as well as a list of classification codes using these thesauri. It also allows for free indexing indexing with an unrestricted vocabulary (delivering so called 'free descriptors ́). These free descriptors are used to enhance and extend the thesauri. The bilingual AUTINDEX module indexes German abstracts in English and
منابع مشابه
Automatic Multilingual Indexing and Classification
Most of today's published scientific and technical articles are written in English. Therefore, the number of English documents being collected by information brokers such as bibliographic database producers, libraries and publishers increases rapidly. However, there will still be a number of documents only available in the native language of the author. One method to facilitate access to this i...
متن کاملAutomatic Multilingual Indexing and Natural Language Processing
The number of documents being collected by information brokers such as bibliographic database producers, libraries and publishers increases rapidly. The consequence is a huge demand for indexing and classification. So far this has had to be carried out manually. The system AUTINDEX, which is described in this paper offers tools for monolingual as well as for multilingual automatic indexing and ...
متن کاملUniversity of Hagen at CLEF 2004: Indexing and Translating Concepts for the GIRT Task
This paper describes the work done at the University of Hagen for our participation at the German Indexing and Retrieval Test (GIRT) task of the CLEF 2004 evaluation campaign. We conducted both monolingual and bilingual information retrieval experiments. For monolingual experiments with the German document collection, the focus is on applying and comparing three indexing methods targeting full ...
متن کاملNCU in Bilingual Information Retrieval Experiments at NTCIR-6
In this paper, we present the mono-lingual and bilingual ad-hoc information retrieval experimental results at NTCIR-6. This year we compare two different word tokenization levels for indexing, namely, unigram, and overlapping bigram. The two famous information retrieval models, i.e., language model, and BM-25 were adopted in our study. In the mono-lingual results show that our method achieved t...
متن کاملContent Based Radiographic Images Indexing and Retrieval Using Pattern Orientation Histogram
Introduction: Content Based Image Retrieval (CBIR) is a method of image searching and retrieval in a database. In medical applications, CBIR is a tool used by physicians to compare the previous and current medical images associated with patients pathological conditions. As the volume of pictorial information stored in medical image databases is in progress, efficient image indexing and retri...
متن کامل